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CLAIMS 

What is claimed is: 

5 

1 . A system for managing a high availability cluster during failover, comprising: 

a primary node running a middleware stack for supporting web applications, wherein a 
plurality of layers of said middleware stack are active, wherein said primary node is assigned a 
10 virtual IP address to which requests are directed; 

a secondary node running a redundant middleware stack mirroring said plurality of layers 
of said middleware stack of said primary node, wherein a first selection of said plurality of layers 
of said redundant middleware stack are active and a second selection of said plurality of layers of 
1 5 said redundant middleware stack are in standby; 

a data replication partition shared between said primary node and said secondary node 
with data accessible to a selection of said plurality of layers of said active middleware stack, 
wherein said selection of said plurality of layers of said active middleware stack correspond to 
20 said second selection of said plurality of layers of said redundant middleware stack in standby; 
and 
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a heartbeat monitor for detecting a failure at said primary node, wherein in response to 
detecting said failure at said primary node, said heartbeat monitor transfers said virtual IP address 
from said primary node to said secondary node, said heartbeat monitor turn off power to said 
5 primary node, said heartbeat monitor remounts said data replication partition for access by said 
secondary node, said heartbeat monitor activates said second selection of said plurality of layers 
of said redundant middleware stack requiring access to said data within said data replication 
partition. 

10 2. The system according to claim 1 for managing a high availability cluster, wherein said 
middleware stack and said redundant middleware stack support J2EE compliant web 
applications. 

3. The system according to claim 1 for managing a high availability cluster, wherein said 
1 5 middleware stack and said redundant middleware stack comprise said plurality of layers 

comprising a selection from among a load balancer, a web server, a web application server, a 
messaging server, a monitoring server, and a database control server. 

4. The system according to claim 1 for managing a high availability cluster, wherein said 
20 standby selection of said plurality of layers of said redundant middleware comprise at least one 

from among a messaging server and a database control server. 
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5. The system according to claim 1 for managing a high availability cluster, wherein a web 
application server active on said secondary node before failover points to a virtual IP address of a 
messaging server and a database control server from said middleware stack of said primary node. 

5 

6. The system according to claim 5 for managing a high availability cluster, wherein 
responsive to detecting said failure at said primary node, said heartbeat monitor takes over said 
virtual IP address and configures said second selection of said plurality of layers of said 
redundant middleware stack with said virtual EP address on startup. 

10 

7. The system according to claim 1 for managing a high availability cluster, wherein a 
message queue and a relational database are launched in said data replication partition. 

8. The system according to claim i for managing a high availability cluster, wherein said 
1 5 data replication partition is a Data Replication Block Device. 
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9. The system according to claim 1 for managing a high availability cluster, further 
comprising: 

a first power supply dedicated to said primary node; 

5 

a second power supply dedicated to said secondary node; 

first connection means for connecting said first power supply and said secondary node; 

10 second connection means for connecting said second power supply and said primary 

node; 

means, responsive to detecting said failure at said primary node, for sending a shutoff 
command via said first connection means for turn off power to said first power supply. 

15 

10. The system according to claim 1 for managing a high availability cluster, further 
comprising: 

a service monitoring daemon for monitoring a status of a plurality of services provided by 
20 said middleware stack, wherein responsive to detecting a failure of a particular service from 

among said plurality of services, said service monitoring daemon restarts another instance of said 
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particular service at said primary node with a particular process identifier. 

1 1 . The system according to claim 1 for managing a high availability cluster, further 
comprising: 

5 

a cross-over cable connected between said primary node and said secondary node via 
network adapters; and 

said heartbeat monitor for sending heartbeat requests via said cross-over cable, wherein in 
10 response to not receive a return of a heartbeat request, said heartbeat monitor detects a failure in 
the non-responsive node. 

12. The system according to claim 1 for managing a high availability cluster, further 
comprising: 

15 

said middleware stack running in said primary node atop a Linux compliant operating 
system layer. 
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13. A system for managing middleware during failo ver in a high availability cluster, 
comprising: 

a primary node comprising an active load balancing controller assigned to a first virtual 
5 IP address, an active HTTP server, an active web application server, an active management queue 
controller, and an active database controller each monitored by a first heartbeat; 



a secondary node comprising a standby redundant load balancing controller, an active 
redundant HTTP server, an active redundant web application server, a standby redundant 
1 0 management queue controller, and a standby redundant database controller each monitored by a 
second heartbeat; 



a drbd data storage partition shared between said primary node and said secondary node, 
wherein a message queue and a database implemented by said active management queue 
1 5 controller and said active database controller of said primary node are activated at said drbd data 
storage partition, and wherein said drbd allows access to said active management queue 
controller and said active database controller configured at a second virtual IP address; and 



20 



a heartbeat monitor for detecting a failure at said primary node, wherein in response to 
detecting said failure at said primary node, said heartbeat monitor transfers said first virtual IP 
address from said active load balancing controller to said redundant load balancing controller and 
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activates said redundant load balancing controller, said heartbeat monitor calls STONITH to turn 
off power to said primary node, said heartbeat monitor activates and assigns said second virtual 
IP address to said redundant management queue controller and said redundant database 
controller, and said heartbeat monitor remounts said drbd partition for access by said redundant 
5 management queue controller and said redundant database controller. 

14. A system for managing J2EE compliant middleware in a high availability cluster during 
failover, comprising: 

10 a first node for running an active stack of J2EE compliant middleware; 



a second node for running a partially active and partially standby redundant stack of J2EE 
compliant middleware; 

4 * 
1 5 a data replication partition shared between said first node and said second node enabling 

access to persistent data resources to only said first node; and 

a heartbeat monitor for detecting a failure in said first node and transferring services 
provided by said first node to said second node, wherein said heartbeat monitor activates said 
20 partially standby redundant stack and remounts said data replication partition for access by said 
activated partially standby redundant stack. 
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15. A method for managing a high availability cluster during failover, comprising: 

controlling a primary node running a middleware stack for supporting web applications, 
5 wherein a plurality of layers of said middleware stack are active, wherein said primary node is 
assigned a virtual IP address to which requests are directed; 

controlling a secondary node running a redundant middleware stack mirroring said 
plurality of layers of said middleware stack of said primary node, wherein a first selection of said 
1 0 plurality of layers of said redundant middleware stack are active and a second selection of said 
plurality of layers of said redundant middleware stack are in standby; 

managing a data replication partition shared between said primary node and said 
secondary node with data accessible to a selection of said plurality of layers of said active 
1 5 middleware stack, wherein said selection of said plurality of layers of said active middleware 

stack correspond to said second selection of said plurality of layers of said redundant middleware 
stack in standby; and 

responsive to detecting a failure at said primary node, transferring said virtual IP address 
20 from said primary node to said secondary node, turning turn off power to said primary node, 
remounting said data replication partition for access by said secondary node, activating said 
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second selection of said plurality of layers of said redundant middleware stack which require 
access to said data within said data replication partition. 

1 6. The method according to claim 1 5 for managing a high availability cluster, further 
5 comprising: 

controlling said middleware stack and said redundant middleware stack to support J2EE 
compliant web applications. 

10 17. The method according to claim 1 5 for managing a high availability cluster, wherein said 
middleware stack and said redundant middleware stack comprise said plurality of layers 
comprising a selection from among a load balancer, a web server, a web application server, a 
messaging server, a monitoring server, and a database control server. 

15 18. The method according to claim 1 5 for managing a high availability cluster, wherein said 
standby selection of said plurality of layers of said redundant middleware comprise at least one 
from among a messaging server and a database control server. 



AUS920040005US1 



-43- 



19. The method according to claim 1 5 for managing a high availability cluster, further 
comprising: 

enabling a web application server active on said secondary node before failover points to 
5 access a messaging server and a database control server from said middleware stack of said 
primary node. 

20. The method according to claim 15 for managing a high availability cluster, further 
comprising: 

10 

launching a message queue and a relational database in said data replication partition. 

21 . The method according to claim 15 for managing a high availability cluster, further 
comprising: 

15 

responsive to detecting said failure at said primary node, for sending a shutoff command 
to turn off power to said primary node. 
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22. The method according to claim 1 5 for managing a high availability cluster, further 
comprising: 

monitoring a status of a plurality of services provided by said middleware stack; 

responsive to detecting a failure of a particular service from among said plurality of 
services, restarting another instance of said particular service at said primary node with a 
particular persistent identifier. 



23. The method according to claim 15 for managing a high availability cluster, further 
comprising: 

sending heartbeat requests between said primary node and said secondary node; and 

responsive to not receive a return of a heartbeat request, detecting a failure in the non- 
responsive node. 
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24. The method according to claim 15 for managing a high availability cluster, further 
comprising: 

running said middleware stack in said primary node atop a Linux compliant operating 
system layer. 

25. A computer program product, residing on a computer readable medium, for managing a 
high availability cluster during failover, comprising: 

means for controlling a primary node running a middleware stack for supporting web 
applications, wherein a plurality of layers of said middleware stack are active, wherein said 
primary node is assigned a virtual IP address to which requests are directed; 

means for controlling a secondary node running a redundant middleware stack mirroring 
said plurality of layers of said middleware stack of said primary node, wherein a first selection of 
said plurality of layers of said redundant middleware stack are active and a second selection of 
said plurality of layers of said redundant middleware stack are in standby; 

means for managing a data replication partition shared between said primary node and 
said secondary node with data accessible to a selection of said plurality of layers of said active 
middleware stack, wherein said selection of said plurality of layers of said active middleware 
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stack correspond to said second selection of said plurality of layers of said redundant middleware 
stack in standby; and 

means, responsive to detecting a failure at said primary node, transferring said virtual IP 
address from said primary node to said secondary node, for turning turn off power to said 
primary node, remounting said data replication partition for access by said secondary node, 
activating said second selection of said plurality of layers of said redundant middleware stack 
which require access to said data in said data replication partition. 

26. The computer program product according to claim 25 for managing a high availability 
cluster, furthering comprising: 

means for controlling said middleware stack and said redundant middleware stack to 
support J2EE compliant web applications. * 

27. The computer program product according to claim 25 for managing a high availability 
cluster, 

means for controlling said middleware stack and said redundant middleware stack 
comprising a selection from among a load balancer, a web server, a web application server, a 
messaging server, a monitoring server, and a database control server. 
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28. The computer program product according to claim 25 for managing a high availability 
cluster, further comprising: 

means for enabling a web application server active on said secondary node before failover 
5 points to access a messaging server and a database control server from said middleware stack of 
said primary node. 

29. The computer program product according to claim 25 for managing a high availability 
cluster, further comprising: . 

10 

means for launching a message queue and a relational database in said data replication 
partition. 

30. The computer program product according to claim 25 for managing a high availability 
1 5 cluster, further comprising: 



means, responsive to detecting said failure at said primary node, for sending a shutoff 
command to turn off power to said primary node. 
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3 1 . The computer program product according to claim 25 for managing a high availability 
cluster, further comprising: 

means for monitoring a status of a plurality of services provided by said middleware 

5 stack; 

means, responsive to detecting a failure of a particular service from among said plurality 
of services, for restarting another instance of said particular service at said primary node with a 
particular persistent identifier. 

10 

32. The computer program product according to claim 25 for managing a high availability 
cluster, further comprising: 

means for sending heartbeat requests between said primary node and said secondary 
15 node; and 

means, responsive to not receive a return of a heartbeat request, for detecting a failure in 
the non-responsive node. 
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33. The computer program product according to claim 25 for managing a high availability 
cluster, further comprising: 

means for running said middleware stack in said primary node atop a Linux compliant 
operating system layer. 



